rank | frequency | n-gram |
---|---|---|
1 | 5002 | -s |
2 | 4467 | -o |
3 | 3887 | -a |
4 | 2008 | -e |
5 | 1270 | -r |
rank | frequency | n-gram |
---|---|---|
1 | 1799 | -os |
2 | 1456 | -as |
3 | 1116 | -es |
4 | 1014 | -do |
5 | 838 | -ão |
rank | frequency | n-gram |
---|---|---|
1 | 568 | -ção |
2 | 479 | -dos |
3 | 478 | -ado |
4 | 473 | -nte |
5 | 373 | -ada |
rank | frequency | n-gram |
---|---|---|
1 | 403 | -ação |
2 | 359 | -ente |
3 | 338 | -ados |
4 | 232 | -adas |
5 | 224 | -ento |
rank | frequency | n-gram |
---|---|---|
1 | 241 | -mente |
2 | 203 | -mento |
3 | 151 | -idade |
4 | 148 | -½ï¿½o |
5 | 111 | -ações |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings